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Abstract 

We introduce the cross-composition framework for proving kernelization lower bounds. A 
classical problem L ANo/OR-cross-composes into a parameterized problem Q if it is possible 
to efficiently construct an instance of Q with polynomially bounded parameter value that 
expresses the logical and or OR of a sequence of instances of L. Building on work by 
Bodlaender et al. (ICALP 2008) and using a result by Fortnow and Santhanam (STOC 
2008) with a refinement by DeU and van Melkebeek (STOC 2010), we show that if an NP- 
hard problem OR-cross-composes into a parameterized problem Q then Q does not admit a 
polynomial kernel unless NP C coNP/poly and the polynomial hierarchy collapses. Similarly, 
an AND-cross-composition for Q rules out polynomial kernels for Q under Bodlaender et al. 's 
AND-distillation conjecture. 

Our technique generalizes and strengthens the recent techniques of using composition 
algorithms and of transferring the lower bounds via polynomial parameter transforma- 
tions. We show its applicability by proving kernelization lower bounds for a number of 
important graphs problems with structural (non-standard) parameterizations, e.g.. Clique, 
Chromatic Number, Weighted Feedback Vertex Set, and Weighted Odd Cycle 
Transversal do not admit polynomial kernels with respect to the vertex cover number 
of the input graphs unless the polynomial hierarchy collapses, contrasting the fact that 
these problems are trivially fixed-parameter tractable for this parameter. We have similar 
lower bounds for Feedback Vertex Set and Odd Cycle Transversal under structural 
parameterizations . 

After learning of our results, several teams of authors have successfully applied the 
cross-composition framework to different parameterized problems. For completeness, our 
presentation of the framework includes several extensions based on this follow-up work. For 
example, we show how a relaxed version of OR-cross-compositions may be used to give lower 
bounds on the degree of the polynomial in the kernel size. 

1 Introduction 

Preprocessing and data reduction are important and widely applied concepts for speeding up 
polynomial-time algorithms, and for making computation feasible at all in the case of hard 
problems that are not believed to have efficient algorithms. Kernelization is a way of formalizing 
data reduction, which allows for a formal analysis of the (im)possibility of data reduction and 
preprocessing. It originated as a technique to obtain fixed-parameter tractable algorithms for 
hard (parameterized) problems, and has evolved into its own topic of research (see [32l [3l 

*This work was supported by the Netherlands Organization for Scientific Research (NWO), project "KER- 
NELS: Combinatorial Analysis of Data Reduction". A preliminary version appeared in the proceedings of the 
28th International Symposium on Theoretical Aspects of Computer Science (STACS 2011). Several results have 
been strengthened compared to the preliminary version. 
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m] for recent surveys). A parameterized problem [231 EH SS] is a language Q C S* x N, 
the second component is called the parameter. Such a problem is (strongly uniformly) fixed- 
parameter tractable (FPT) if there is a computable function / such that membership of an 
instance {x,k) can be decided in f{k)rP^^'> time. A kernelization algorithm (kernel) for Q 
transforms an instance {x,k) in time polynomial in \x\ + k into an equivalent instance {x',k') 
such that max(|x'|, fe') < f{k) for some computable function /, which is the size of the kernel. 

It is well-known that a decidable problem admits a kernel if and only if it is fixed-parameter 
tractable (cf. [3]). Prom a practical perspective we are particularly interested in cases where the 
kernel size / is polynomially bounded in k, so-called polynomial kernels. Success stories of kernel- 
ization include the 0{k'^) kernel for /c- Vertex Cover containing at most 2k vertices p^[T3| [T] 
and the meta-theorems for kernelization of problems on topological graph classes [5l[28]. The 
research community spent many years trying to find polynomial kernels for elusive problems 
such as /c-Path, before techniques were developed which made it possible to prove (under some 
complexity-theoretic assumption) that a parameterized problem in FPT does not admit a poly- 
nomial kernel. Bodlaender et al. |4j introduced the concept of an OK- composition algorithm as 
a tool to give super-polynomial lower bounds on kernel sizes. Informally speaking — we defer 
formal definitions to Section [2] — an OR-composition algorithm combines a series of inputs of 
a parameterized problem Q, all sharing the same parameter k, into a single instance (x', k') 
of Q such that k' is polynomial in k, and {x',k') acts as the logical OR of the inputs in the 
sense that {x',k') is a YES-instance if and only if (at least) one of the inputs is. They used 
a theorem by Fortnow and Santhanam |29] to show that if there is an OR-composition for an 
NP-hard parameterized problem Q, then Q does not admit a polynomial kernelization unless 
NP C coNP/poly. This machinery made it possible to prove, e.g., that /c-Path and the Clique 
problem parameterized by the treewidth of the graph do not admit polynomial kernels unless 
NP C CO NP/pol30. The latter condition is equivalent to coNP C NP/poly, and is deemed un- 
likely as it implies a collapse of the polynomial hierarchy to its third level |49j and further [11]. 

It did not take long before the techniques of Bodlaender et al. were combined with the notion 
of a polynomial parameter transformation to also prove lower bounds for problems for which no 
direct OR-composition algorithm could be found. This idea was used implicitly by Fernau et 
al. |26] to show that A;-Leaf Out-Branching does not admit a polynomial kernel, and was 
formalized in a paper by Bodlaender et al. [10]: they showed that if there is a polynomial-time 
transformation from Q to Q' which incurs only a polynomial blow-up in the parameter size, then 
if Q does not admit a polynomial kernel then Q' does not admit one either. These polynomial 
parameter transformations were used extensively by Dom et al. |22j who proved kernelization 
lower bounds for a multitude of important parameterized problems such as SMALL Universe 
Hitting Set and Small Universe Set Cover. Deh and van Melkebeek [21] were able to 
extend the techniques of Fortnow and Santhanam to prove, e.g., that Vertex Cover does 
not admit a kernel of bitsize 0{k'^~^) for any e > 0. Follow-up work has resulted in lower 
bounds on the degree of the polynomial in the kernel sizes of packing problems I35j. using 
composition-like techniques in which the output parameter is allowed to depend sublinearly on 
the number of inputs. The complementary witness lemma of Dell and van Melkebeek |21j shows 
that even co-nondeterministic compositions give kernel lower bound^; this subsequently led 
one of the current authors to leverage the power of co-nondeterminism to obtain a lower bound 
for a Ramsey-type problem [39j. 

Our contribution. We introduce the framework of cross-composition for obtaining ker- 

^In the remainder of this introduction we assume that NP 2 coNP/poly when stating kernehzation lower 
bounds. 

^In fact, this is aheady imphcit in the results of Fortnow and Santhanam as observed in unpublished work of 
Chen and Miiller (cf. W). 
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nelization lower bounds, and give several applications to fundamental graph problems under 
structural parameterizations. The technique generalizes and strengthens the earlier methods 
of compositions [4J and polynomial-parameter transformations [lOJ, and puts the two existing 
methods for obtaining kernelization lower bounds in a common perspective. Whereas the exist- 
ing notion of AND or OR-composition works by composing multiple instances of a parameterized 
problem Q into a single instance of Q with a bounded parameter value, in the cross-composition 
framework it is sufficient to compose the and or OR of any classical NP-hard problem into an 
instance of the parameterized problem Q for which we want to prove a lower-bound. The term 
cross in the name stems from this fact: the source- and target problem of the composition need 
no longer be the same. Since the input to a cross-composition algorithm is a list of classical 
instances instead of parameterized instances, the inputs do not have a parameter in which the 
output parameter of the composition must be bounded; instead we require that the size of the 
output parameter is polynomially bounded in the size of the largest input instance. This makes 
our extension also fruitful when the source and target problems of the composition coincide, 
since it allows for a larger parameter value in the target instance than what would be permissible 
in a "classic" composition. In addition we show that the output parameter for a composition of t 
instances may depend polynomially on the logarithm of the number of input instances, which 
often simplifies the constructions and proofs. We also introduce the concept of a polynomial 
equivalence relation to remove the need for padding arguments which were frequently required 
in compositions. 

We exhibit the power of cross-composition by giving kernelization lower bounds for structural 
parameterizations of several important graph problems. Since many combinatorial problems are 
easy on graphs of bounded treewidth [9j , and since the treewidth of a graph is bounded by the 
vertex cover number, many problems are fixed-parameter tractable when parameterized by the 
cardinality of a given vertex cover. We show that this tractability does not extend to polyno- 
mial kernehzability: Clique, Chromatic Number, Weighted Feedback Vertex Set, and 
Weichted Odd Cycle Transversal do not admit polynomial kernels under this parame- 
terization. In the case of Clique it was already known [4j that the problem does not admit a 
polynomial kernel parameterized by the treewidth of the graph; since the vertex cover number is 
at least as large as the treewidth we prove a stronger result. By taking the edge-complement of 
the input graph. Clique parameterized by the size of a given vertex cover is FPT-equivalent to 
Vertex Cover parameterized by the size of a given vertex set whose removal leaves a clique. 
Hence we also establish a kernel lower bound for the latter problem, which marks an interest- 
ing boundary in the search for the smallest parameterization of Vertex Cover admitting a 
polynomial kernel [18l[36lHI]- We provide a polynomial-parameter transformation to extend 
the lower bound to Feedback Vertex Set and Odd Cycle Transversal parameterized 
by the vertex-deletion distance to a clique, contrasting the known polynomial kernehzability of 
these problems with respect to the natural parameterization by solution size |48[ H2j . These 
results strengthen lower bounds given in a preliminary version of this work, where we supplied 
lower bounds for the smaller parameters "deletion distance to a cluster graph" and "deletion 
distance to a co-cluster graph" . 

Since the introduction of the cross-composition framework, it has found numerous applica- 
tions by various sets of authors I71|8l[Ill|15l[l6l[I71[ia[3ll|36ll38]. Although all applications of 
the cross-composition technique presented here are for problems under structural parameteriza- 
tions, the framework can also be used to obtain lower bounds for natural parameterizations. For 
example, Cygan et al. [161 employed cross-composition to obtain kernelization lower bounds for 
Edge Clique Cover, settling an open problem in kernelization, and also for various graph cut 
problems parameterized by the size of the cutset. The mentioned follow-up work together with 
other results on lower bounds for kernelization [211 EQl ESI [39] also suggests several extensions. 
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most of which are reviewed and included in this work. For one, we include the variant of an 
AND-cross-composition as used first by Cygan et al. [16j. Furthermore, we explicitly define a 
modification of OR-cross-compositions that allows proofs of polynomial lower bounds for ker- 
nelization in the style of Dell and van Melkebeek [2i|. Regarding co-nondeterministic variants 
of OR-cross-compositions we refer the reader to the full version of recent work by Kratsch [39j H 
Organization. We start by giving formal definitions of all concepts relevant for our dis- 
cussion of kernelization lower bounds in Section [2l thereby providing a brief overview of the 
existing techniques. In Section [3] we present the cross-composition framework. Our treatise of 
the subject incorporates several small improvements that have been found after an extended 
abstract of this work was published [6]. After giving the definitions of the various notions of 
cross-composition, we show how they imply kernel lower bounds. The remainder of the paper is 
concerned with applications of the technique to specific parameterized problems. As these are 
all graph problems, we give some graph-theoretic preliminaries in Section [4. ll before presenting 
our lower bounds. 

2 A review of kernelization lower bound techniques 

Before introducing the framework of cross-composition, we give formal definitions of the various 
concepts relating to kernelization. Let us fix some notation. Throughout this work we use S 
to denote finite alphabets; note that S may refer to a different alphabet for each problem. 
We denote classical problems as L C S* and parameterized problems as Q C S* x N. Since 
tuples (x, k) can straightforwardly be encoded as strings over S (when [S| > 2) we may also say 
that a parameterized problem is NP-hard or NP-complete; in all cases the parameter value k 
of (x, k) is encoded in unary, for a total size of 0{\x\ + k). We call a string x G S* an instance 
for L C E*; ditto for (x, fe) G E* x N and Q C E* x N. We say that the instance x G E* is 
YES for L C E* if X G L, and NO for L otherwise; the same holds for (x, k) G Q. For positive 
integers n we define [n] := {1, . . . , n}. 

2.1 Kernelizations and Compressions 

We begin with a formal definition of kernelization and generalized kernelization (also called bi- 
kernel [2]). Subsequently we also define a natural notion of compressing instances of parameter- 
ized problems into short strings over some alphabet, i.e., compressing to some language L C E*, 
which relaxes the notion of kernelization. The known kernelization lower bound techniques, 
namely via compositions, cross-compositions, and the complementary witness lemma, can all 
be easily seen to apply also to such compressions. In the following we will state the known tools 
in this more general way. 

Definition 1 (kernelization, generalized kernelization). Let Q, Q' C E* x N be parameterized 
problems and let h: N — ?> N be a computable function. A generalized kernelization for Q into Q' 
of size h[k) is an algorithm that on input (x, /;:) G E* x N takes time polynomial in |x| -|- k and 
outputs an instance {x',k') such that: 

"PB": The size of x' and the parameter value k' are bounded by h{k). 
"EQ": The instance (x', k') is yes for Q' if and only if (x. A;) is yes for Q. 

The algorithm is a kernelization, or in short a kernel, for Q if Q' = Q. It is a polynomial 
(generalized) kernelization if h{k) is a polynomial. 

^This is not yet available but will soon appear in Transactions in Algorithms. 
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Definition 2 (compression). Let Q C E* x N be a parameterized problem, let L C S* be a 
language, and let h: N ^ N be a computable function. A compression for Q into L of size h{k) 
is an algorithm that on input {x,k) € S* x N takes time polynomial in \x\ + k and outputs a 
string y € T,* such that: 

"PB": The length of y is bounded by h{k). 

"EQ": The string y is YES for L if and only if (x, k) is YES for Q. 

The algorithm is a polynomial compression if h{k) is a polynomial. 

The following proposition explicitly states that compressions are a relaxation of generalized 
kernelizations, which in turn relax the notion of kernelization. 

Proposition 1. Let Q C S* x N 6e a parameterized problem. If Q has a kernelization with 
size h: N N then Q also has a generalized kernelization with size h. Similarly, if Q has a 
generalized kernelization with size h then Q has a compression with size 0{h). 

Remark 1. The proposition follows immediately from Definitions [T] and El Let us clarify how- 
ever that a compression can be derived from a (generalized) kernelization with kernel size h 
by mapping each output instance {x',k') € S* x N to an encoding as a string z (z T,* (as- 
sume |S| > 2, otherwise use S' := {0,1}). With a unary encoding for k' this gives a bound 
of 0{h{k)) on the length of z since both \x'\ and k' are bounded by h{k), where k is the input 
parameter. 

2.2 Compositions and Distillations 

We now turn our attention to the main components of the original kernelization lower bound 
framework of Bodlaender et al. [3]. We will first recall the notions of OR/AND-composition and 
weak OR/AND-distillations. Both notions formalize ways of encoding the logical OR respectively 
AND of a large number of instances into a single instance that is small according to some measure. 
Compositions address parameterized problems and require the output parameter to be small. 
Distillations address classical problems, or languages, and demand a small overall output size. 
Although we do not use the compositions of Bodlaender et al. [4J in this work, we supply the 
definitions to highlight the differences with cross-composition. 

Definition 3 (OR/AND-composition |3]). Let Q C S* x N be a parameterized problem. An 
OK- composition for Q is an algorithm that, given t instances (xi, k), . . . ,(xt,k) £ T,* x N of Q, 
takes time polynomial in Yll=i 1^*1 + ^ outputs an instance (y, k') G S* x N such that: 

"PB": The parameter value k' is polynomially bounded in k. 

"OR": The instance {y,k') is yes for Q if and only if at least one instance {xi,k) 
is YES for Q. 

An AND -composition is an algorithm that, instead, fulfills Properties "PB" and "AND". 

"AND": The instance {y,k') is yes for Q if and only if all instances {xi,k) are 
YES for Q. 

A parameterized problem Q for which such an algorithm exists is called OR- compositional re- 
spectively AND- compositional. 

Definition 4 (weak OR/AND-distillation [H [29]). Let L,L' C S* be languages. A weak OR- 
distillation of L into V is an algorithm that, given t instances j;i,X2, . . . ,xt € T,* of L, takes 
time polynomial in Yll=i \^i\ outputs a string y G S* such that: 
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"PB": The length of y is polynomially bounded in max*^^ \xi\. 

"OR": The instance y is yes for L' if and only if at least one instance Xi is YES 
for L. 

A weak A^D- distillation of L into L' is an algorithm that, instead, fulfills Properties "PB" and 
"AND". 

"AND": The instance y is yes for L' if and only if all instances Xi are yes for L. 

The crux of the framework is that the combination of a polynomial kernelization and a 
composition, for some parameterized problem Q, yields a distillation for Q (when interpreted 
as a classical problem). In the following we denote the satisfiability problem for boolean formulae 
as SAT. 

Theorem 1 ([!]). Let Q C S* x N 6e a parameterized problem that is ISlP-hard when the 
parameter value is encoded in unary, i.e., when instances {x,k) have size 0{\x\ + k). If Q has 
a polynomial kernelization (or polynomial compression) and an OR-composition, then SAT has 
a weak OR- distillation into some language L C S*. If Q instead has an AND-composition, then 
SAT has an and -distillation into L. 

Remark 2. The reason for requiring NP-hardness with parameter value encoded in unary is 
that this implies a Karp reduction from instances (j) for SAT to instances (x. A;) for Q with k 
polynomially bounded in \(f)\. 

Bodlaender et al. |4j conjectured that no NP-hard problem has a distillation. 

Conjecture 1 (OR-distillation conjecture [1]). No NP-hard problem L has a weak OR-distil- 
lation into any language L' . 

Conjecture 2 (AND-distillation conjecture [1]). No NP-hard problem L has a weak and- 
distillation into any language L' . 

The OR-distillation conjecture was proved shortly after by Fortnow and Santhanam [29], 
under the assumption that NP ^ coNP/poly. Drucker |24] recently announced a proof of the 
AND-distillation conjecture based on the same assumption, but using much more complicated 
machinery. 

Theorem 2 ([29j). // there is a weak OR-distillation of SAT into any language L C S* then 
NP C coNP/poly and the polynomial-time hierarchy collapses to its third level (PH = EgJ. 

2.3 Transformations and Oracle Communication Protocols 

The result of Fortnow and Santhanam [29j was generalized by Dell and van Melkebeek [21j. 
who abstract away from the perspective of kernelization and distillation by considering an 
appropriate oracle communication protocol. They obtain more fine-grained results by taking 
into account not just the number of instances that are combined but also their maximum size. 
This way they prove a more general form of Theorem [21 namely their complementary witness 
lemma (see below) and also various polynomial lower bounds for kernelization (and PCPs); e.g., 
they show that d-HiTTiNG Set (i.e.. Vertex Cover on hypergraphs with edge size at most d) 
admits no polynomial kernelization with size 0{k'^~^), for any e > 0, unless NP C coNP/poly. 
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Definition 5 (oracle communication protocol [21j). An oracle communication protocol for a 
language L is a communication protocol for two players. The first player is given the input x and 
has to run in time polynomial in the length of the input; the second player is computationally 
unbounded but is not given any part of x. At the end of the protocol the first player should be 
able to decide whether x G L. The cost of the protocol is the number of bits of communication 
from the first player to the second player. 

Definition 6 (or/and of a language [21] )• Let L C E* be a language. The OR of the language L, 
denoted by Or(L), is the set of all tuples (xi, . . . , xt) for which Xi L for at least one i G [t]. The 
AND of the language L, denoted by and(L), is the set of all tuples {xi, . . . , xt) for which Xi (z L 
for all i (z [t]. 

Lemma 1 (complementary witness lemma plj). Let L be a language andt: N — )• N\{0} be poly- 
nomially bounded such that the problem of deciding whether tuples {xi, . . . , of strings each 

of length at most s belong to 0r(L) has an oracle communication protocol of cost 0{t{s) logt(s)), 
where the first player can be co-nondeterministic. Then L G coNP/poly. 

Finally, we turn our attention to a different way of proving lower bounds for kernelization, 
namely the use of appropriate reductions. The following type of parameterized reduction, origi- 
nally called polynomial time and parameter transformation, was first formalized by Bodlaender 
et al. [To]. It is not hard to see that they are strongly related to the existence of polynomial 
kernelizations (or compressions). 

Similarly, one may define parameterized reductions with other bounds on the new parameter 
value, but they are not required in this work. Reductions with a linear dependence have been 
used to transfer polynomial bounds for kernelization [25] and in the context of lower bounds on 
the runtime of FPT-algorithms |43] . 

Definition 7 (polynomial parameter transformation [10]). Let Q, Q' C S* x N be parameter- 
ized problems. A polynomial parameter transformation from Q to Q' is an algorithm that on 
input (x, A;) G S* X N takes time polynomial in |x| + k and outputs an instance (x', k') G E* x N 
such that: 

"PB": The parameter value k' is polynomially bounded in k. 

"EQ": The instance (x', A;') is yes for Q' if and only if (x, k) is yes for Q. 

We denote the existence of such a transformation by Q <ppt Q! ■ 

Theorem 3 ([!])■ Let Q C S* x N 6e a parameterized problem that is 'NP-hard with parameter 
encoded in unary, and let Q' Q T,* x N be a parameterized problem that is contained in NP. 
If Q ^ppt Q' o,nd Q' admits a polynomial kernelization then Q admits a polynomial kernelization. 

To avoid the complication of having to insist on the unary encoding of the parameter value 
we suggest the following proposition, which applies to the existence of polynomial compressions 
(and generalized kernelizations). Since lower bounds for kernelization for some parameterized 
problem Q C S* x N via the tools in this section exclude also polynomial compressions, it seems 
much easier to only require Q <ppt Q' to transfer the lower bound to any other parameterized 
problem Q' . 

Proposition 2. Let Q and Q! be parameterized problems with Q <ppt Q' ■ If Q! admits a 
polynomial compression then Q admits a polynomial compression. If, possibly under some 
complexity-theoretic assumption, Q admits no polynomial compression then Q' admits no poly- 
nomial compression under the same assumption. The same is true regarding generalized poly- 
nomial kernelizations. 
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3 Cross-composition 



3.1 The Basic Framework 

The two main concepts of the framework are defined as follows. 

Definition 8 (polynomial equivalence relation). An equivalence relation TZ on S* is called a 
polynomial equivalence relation if the following two conditions hold: 

1. There is an algorithm that given two strings x,y G S* decides whether x and y belong to 
the same equivalence class in time polynomial in jx| + \y\. 

2. For any finite set 5 C S* the equivalence relation TZ partitions the elements of S into a 
number of classes that is polynomially bounded in the size of the largest element of S. 

Remarlc 3. The intended use of polynomial equivalence relations is to group inputs for a 
(cross-)composition such that the composition will only be applied to groups of instances that 
are somewhat similar, e.g., they ask for the same solution size or the considered graphs have 
the same numbers of vertices and edges. We point out that 7^ = S* x S* trivially fulfills 
the definition of a polynomial equivalence relation (all strings are equivalent and any set S is 
"partitioned" into a single class). Thus the use of polynomial equivalence relations in cross- 
compositions according to the following definition is optional. 

Definition 9 (AND/OR-cross-composition). Let L C S* be a language, let 7^ be a polynomial 
equivalence relation on E*, and let Q C S* x N be a parameterized problem. An OR-cross-com- 
position of L into Q (with respect to TV) is an algorithm that, given t instances xi,X2, . . . , S S* 
of L belonging to the same equivalence class of TZ, takes time polynomial in Yll=i I ^^'^ outputs 
an instance (y, A;) € S* x N such that: 

"PB": The parameter value k is polynomially bounded in maxj \xi \ + logt. 

"OR": The instance {y, k) is yes for Q if and only if at least one instance Xi is YES 
for L. 

An AND -cross- composition of L into Q (with respect to TZ) is an algorithm that, instead, fulfills 
Properties "PB" and "AND". 

"AND": The instance {y,k) is yes for Q if and only if all instances Xj are YES 
for L. 

We say that L OR-cross-composes, respectively AND-cross-composes, into Q if a cross-composition 
algorithm of the relevant type exists for a suitable relation TZ. 

For historic reasons we reserve cross-composition (without preposition) to refer to the OR 
variant. As with plain AND/OR-compositions, an ANo/OR-cross-composition can be combined 
with a polynomial compression to give a weak ANo/OR-distillation; we prove this explicitly for 
the case of AND-cross-compositions. 

Theorem 4. Let L Q T,* be a language and let Q Q Y,* x N be a parameterized problem. If L 
AND-cross-composes into Q and Q has a polynomial compression (into an arbitrary language L' ) 
then L has a weak and -distillation (into and{L')). 

Proof. To prove the theorem we construct the claimed weak distillation by generalizing the 
approach of Bodlaender et al. Let C denote an AND-cross-composition algorithm from L 
into Q and let TZ denote a suitable polynomial equivalence relation. Let K denote a polynomial 



8 



compression algorithm for Q into L' . The input to the distillation is a sequence (xi, . . . of 
instances of L C S*. Define s := max-^-^ \xi\. If t > (|E| + 1)^ then there must be duplicate 
inputs, since the number of distinct inputs of length s' < s is . By discarding duplicates 
we may therefore assume that t < (|S| + 1)* and hence that logi € 0{s). 

The algorithm first partitions the input instances into disjoint nonempty sets Xi^ . . . ^X^ 
according to equivalence under TZ. By the definition of IZ it is clear that this can be done 
in time polynomial in the input size Next, the AND-cross-composition algorithm C is 

applied to each set Xj, 1 < i < r, and produces one instance {yi, ki) G E* x N from each set Xi. 
From the definition of an AND-cross-composition and using logt G 0{s) it follows that each ki 
is polynomially bounded in s, and that the computation of these parameterized instances takes 
time polynomial in the total input size. It also follows that (y^, ki) is a yes instance of Q if and 
only if all of the instances in Xi are yes for L. 

Now, we apply the polynomial compression K to each instance (y^, ki) to obtain an equivalent 
instance Zi of L' for each 1 < i < r. Since X is a polynomial compression we know that these 
transformations can be carried out in polynomial time and that \zi\ is polynomially bounded 
in ki. Since ki is polynomial in s it follows that \zi\ is also polynomial in s for 1 < i < r. 
Finally, the algorithm simply combines all strings Zi into one tuple z* := {zi, . . . ,Zr), which 
is an instance of and(L'). Since the size of each component is polynomial in s, and since the 
number of components r is polynomial in s, we have that \z* \ is polynomial in s. By Definition[6] 
we know that z* € and(L') if and only if all elements of the tuple are contained in L' . By tracing 
back the series of equivalences we therefore find that z* G and(L') if and only if all inputs Xi 
are yes for L. Since we can construct z* in polynomial time and \z*\ is polynomial in s, we 
have constructed a weak distillation of L into and(L'). □ 

The analogous result holds for OR-cross-compositions and can be proven along the same 
lines. 

Theorem 5. Let L Q T,* be a language and let Q Q Y,* x N be a parameterized problem. If L 
OR-cross-composes into Q and Q has a polynomial compression (into an arbitrary language L') 
then L has a weak OR- distillation (into Or{L')). 

The machinery developed in this section leads to two ways of giving kernelization lower 
bounds, via and- respectively OR-cross-compositions. 

Corollary 1. If an NP-hard language L AND- cross- composes into the parameterized problem Q, 
then Q does not admit a (generalized) polynomial kernelization or polynomial compression unless 
the ANT) -distillation conjecture fails. 

Corollary 2. // an NP-hard language L OR-cross-composes into the parameterized problem Q, 
then Q does not admit a (generalized) polynomial kernelization or polynomial compression unless 
the OR- distillation conjecture fails and NP C coNP/poly. 

The lower bounds via OR-cross-compositions are somewhat stronger due to the result of 
Fortnow and Santhanam (Theorem [2|) who proved the OR-distillation conjecture under the 
assumption that NP ^ coNP/poly. Note that a weak OR-distillation for an NP-hard language L 
can be combined with the NP-hardness reduction from sat to L, resulting in a weak OR- 
distillation for sat. Using the tight bounds on the compressibility of Or(L) that are provided 
by the complementary witness lemma, it is possible to derive a statement that can also be applied 
to give lower bounds on the degree of the polynomial for problems that do admit polynomial 
kernels. We state and prove this extension in the next section as Theorem [6l it also implies the 
NP C coNP / poly consequence of Corollary [2] but skips the intermediate step of building a weak 
OR-distillation. 
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3.2 Cross-compositions of bounded cost 

The mentioned recent results of Dell and van Melkebeek [21] that permit polynomial lower 
bounds via oracle communication protocols for the OR of languages also permit us to create a 
similar extension for OR-cross-compositions. By a similar analysis as performed by Dell and van 
Melkebeek [21] (see also [35j) we can establish that appropriate cross- compositions with bounded 
cost imply polynomial lower bounds, assuming NP ^ coNP/poly. 

Definition 10 (ANo/OR-cross-composition of bounded cost). An AND/OR-cross-composition 
of L into Q (with respect to TZ) of cost f{t) is an AND/OR-cross-composition algorithm as 
described in Definition [9] that satisfies "CB" instead of "PB". 

"CB": The parameter k is bounded by 0{f{t) ■ (maxj where c is some con- 

stant independent of t. 

Theorem 6. Let L Q T,* be a language, let Q T,* x N be a parameterized problem, and 
let d,e be positive reals. If L has an OR-cross-composition into Q with cost fit) = 
where t denotes the number of instances, and Q has a polynomial compression into an arbitrary 
language L' with size bound 0{k'^~'') then L € coNP/poly. If, additionally, L is NP-hard then 
NP C coNP/poly. 

Proof. Let 7^ be a polynomial equivalence relation on S* which partitions any set of strings 
of length at most s into at most 0{s^) equivalence classes. Let f(t) = for some 

constant d. Let C be an OR-cross-composition from L into Q, which maps t instances of size 
at most s and from the same 7^-equivalence class to an output instance with parameter value 
bounded by 0{f{t)s'^). Finally, let K he, a, polynomial compression for Q into some language L' 
that outputs a string of size bounded by h{k) = 0{k'^~'^). 

We define a polynomially bounded function t by t{s) := s(''+'^'^)'7. By Lemma[T]it suffices to 
provide an oracle communication protocol for or(L) with cost 0{t{s)\ogt{s)) for a tuple of t 
strings each of length at most s. Fixing s and t := t{s), let a tuple of t strings each of length at 
most s be given to the first player, say (j;i, . . . , xt). 

Let us first describe the protocol and bound the communication cost. The first player begins 
by partitioning the strings Xi according to equivalence under TZ, obtaining r < O(s^) groups. 
Then he applies the OR-cross-composition C to each group, obtaining r instances (yi, fci), . . . , 
(yr, kr). The parameter values ki are bounded by 0{f{t)s'^). 

Now the first player applies the assumed polynomial compression K to each instance (y,, ki), 
obtaining instances zi, . . . ,Zr of the language L'. Then he sends the instances Zi to the second 
player, who answers yes if at least one of the instances sent to him is yes, and NO otherwise. 
(Note that in this simple protocol the second player might as well be an L'-oracle, and the cost 
would correspond to the total length of queries.) 

Each compressed instance has size at most h{ki) = 0((/cj)'^~'^) = 0{{f{t)s'^)'^~'^). Thus we 
can bound the cost of sending the r compressed instances to the second player as follows: 

O (r • {f{t)s''f-'^ = O (^s'' (t-^+^^^h'^Y'^^ = O = 0{t), 

using that r < 0{s^) and the following bound for g^+cC^i-e); 

^b+c{d~e) ^ ^b+cd . g-ce ^ . ^-ce ^ ^^-5 ^ 

^^ere S = j^^^ > 0. (Note that = 0{t), for any 6 > 0.) 
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Correctness. It remains to show correctness of the protocol. Assume first that at least one 
input instance Xi is a YES-instance (requiring the protocol to correctly answer YES). It follows 
that the corresponding instance {yj , kj ) that is created by C from all instances 7^-equivalent 
to Xi must be YES for Q. Accordingly, the polynomial compression K transforms {yj,kj) to a 
YES-instance for the language L' . Hence, the oracle correctly answers yes, as it is given at least 
one YES-instance for L'. 

In the remaining case all input instances xi, . . . ,xt are NO for L. The OR-cross-composition C 
will therefore create r NO-instances {yi, ki) for Q. These are converted to r NO-instances Zi for L' . 
Hence, presented with only NO-instances for L', the oracle answers NO. 

Thus we get an oracle communication protocol for deciding the OR of t instances of L at 
cost 0{t). By Lemma [1] this implies that L is contained in coNP/poly. Hence, if L is NP-hard 
then NP C coNP/poly. □ 

It is an immediate consequence that cross-compositions with cost f{t) = t"^^^ exclude any 
polynomial compression. To see this, observe that their cost is bounded also by t^/^~^°W for 
any positive constant d. Thus, for any constant d this excludes polynomial compressions to 
size 0{k'^~'^) and hence any polynomial compression. 

Corollary 3. Let L C S* be a language and let Q Q T,* x N be a parameterized problem. If L 
has an OR-cross-composition into Q with dependence f{t) = t°^^^ on the number t of instances 
and Q has a polynomial compression into an arbitrary language L' then L G coNP/poly. If, 
additionally, L is NP-hard then NP C coNP/poly. 

4 Our lower bounds 

Cross-composition makes it possible to give kernelization lower bounds for structural parame- 
terizations of various fundamental graph problems. In the following sections we study Clique, 
Chromatic Number, Feedback Vertex Set, and Odd Cycle Transversal. We show 
that parameterizations solvable by the simplest of FPT algorithms nevertheless fail to admit 
polynomial kernelizations unless NP C coNP/poly. All lower bounds we give in this section also 
apply to generalized kernelizations, but for brevity we only state them for normal kernelizations. 
The cross-compositions employed in this section all take the form of OR-cross-compositions. For 
historic reasons and simplicity we shall therefore omit the adjective OR in the remainder. Before 
presenting our kernel lower bounds we give some graph-theoretic preliminaries. 

4.1 Graph preliminaries 

We will only work with undirected, finite, simple graphs. For a graph G we denote its vertex 
set by V{G) and the edge set by E{G). We use G[V'] to denote the subgraph of G induced 
by V', i.e., the graph on vertex set V' and edge set {{tt, v} G E{G) \ u,v ^ V'}. We use G — Z 
as an abbreviation for G[V{G) \ Z\. The open neighborhood of a vertex v in graph G contains 
the vertices adjacent to and is written as Nq{v). The open neighborhood of a set S C V{G^ 
is defined as \}^^s^Giv) \ S. Identifying a set 5 C V{G) of vertices of graph G into a new 
vertex x, results in the graph obtained from G — S" by adding a new vertex x adjacent to the 
vertices in Ng{S). The edge-complement of G is the graph G with edge set {{u,v} C V{G) \ 
u ^ V A {u,v} ^ E{G)}. An odd cycle is a simple cycle on an odd number of vertices. An 
i-coloring of graph G is a function /: V{G) [i], and it is a proper coloring if {u,v} E E{G) 
implies f{u) 7^ f{v). 

The chromatic number x{G) of the graph G is defined as the smallest integer I for which G 
has a proper ^-coloring. A vertex cover in the graph G is a subset Z C V{G) which intersects 
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all edges, therefore implying that G — Z is an edgeless graph. If Z intersects all cycles then 
it is a feedback vertex set (fvs), and its removal G — Z yields a forest. Finally, an odd cycle 
transversal (oct) intersects all odd cycles, and its removal results in a bipartite graph. The 
vertex cover number of G is the size of a smallest vertex cover; the numbers related to the other 
two types of transversals are defined analogously. The vertex-deletion distance of a graph G to 
a graph family is the size of a smallest set S C V{G) such that G — S £ T. We use Kt and Pt 
to denote a clique or path on t vertices, respectively. We will occasionally use n to refer to 
the number of vertices in the (input) graph under consideration. When taking the n-bit binary 
expansion of numbers in the range 1 up to 2", we map 2" to the bitstring consisting of n zeros. 

4.2 Cliques and Vertex Covers 

In this section we prove a kernel lower bound for a strong structural parameterization of Clique, 
and we consider some of its consequences. An instance of the NP-complete Clique problem [30l 
GT19] is a tuple {G,£) and asks whether the graph G contains a clique on i vertices. This 
classical problem is the source language for the cross-composition which establishes our first 
kernelization lower bound. The parameterization for which we derive a lower bound is formally 
defined as follows. 

Clique parameterized by Vertex Cover 

Instance: A graph G, a vertex cover Z C V(G), and a positive integer i. 
Parameter: The size k := \Z\ of the vertex cover. 
Question: Does G have a clique of size at least i? 

Note that we supply a vertex cover in the input of the problem, to ensure that well-formed 
instances can be recognized efficiently: the parameter to the problem claims a bound on the 
vertex cover number of the graph, and using the set Z we may verify this bound in polynomial 
time. We will define the other parameterized problems for which we establish kernel lower 
bounds similarly. Observe that these versions of the problems, where a witness of the parameter 
value is supplied in the input, are certainly no harder to kernelize than the versions where 
such a witness is not given. An extensive discussion of the technicalities regarding structural 
parameterizations can be found in the recent survey by Fellows et al. [25]. 

Let us briefly consider the complexity of the chosen parameterization of the Clique problem. 
If the graph G has a vertex cover Z and contains a clique G C V{G), then |C \ Z| < 1: as 
the definition of a vertex cover ensures there are no edges between vertices in V{G) \ Z, any 
clique contains at most one vertex from V{G) \ Z. The problem defined above can therefore 
be solved in 0(2l^ln^) time by enumerating all subsets C C Z, checking whether they induce 
a clique G[C], and whether there is a vertex in V{G) \ Z adjacent to all members of C which 
may be added to the clique. We detect a maximum clique when its intersection with Z is used 
as the subset C. 

This same insight also yields a so-called cheating kernel [26j or Turing kernel jSl]. For 
an input instance {G,Z,£,k) of Clique parameterized by Vertex Cover, create a list of 
instances which contains for each v G ViG) \ Z the instance {G[Z U {v}], Z,£,k), in addition 
to {G[Z], Z,i,k). The preceding discussion shows that G has a clique of size £, if and only 
if one of the instances in the list has such a clique. As each instance in the list contains at 
most |Z| + 1 = /c + 1 vertices, this is an interesting example of a rare phenomenon: although 
we cannot, in polynomial time, reduce the size of an instance of Clique parameterized 
BY Vertex Cover to polynomial in the parameter, we can create 0{n'~^^^^) instances of size 
polynomial in the parameter, such that the input is yes if and only if one of the outputs is. 
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Figure 1: A sketch of the construction used in the proof of Theorem [71 The dashed edges show 
in an exemplary way how vertices Wp^q, Wp^q, and connected to vertices of B and C, 

e.g., {p,q} is an edge of Gi but not of Gj. 

Having provided all the necessary background for the following lower bound, we give its 
proof by cross-composition. 

Theorem 7. Clique parameterized by Vertex Cover does not admit a polynomial kernel 
unless NP C coNP/poly. 

Proof. We prove the theorem by showing that Clique cross-composes into Clique parame- 
terized BY Vertex Cover; by Corollary [2] this is sufficient to estabhsh the claim. We define 
a polynomial equivalence relation TZ such that all bitstrings that do not encode a valid instance 
of Clique or that have a target clique size £ that exceeds the number of vertices are equivalent. 
Of the remaining instances any two (well- formed) instances (Gi,£i) and {G2A2) are equivalent 
if and only if they satisfy = |F(G2)| and £1 = £2- From this definition it follows that 

any set of well- formed instances on at most n vertices each is partitioned into 0{n^) equivalence 
classes. Thus TZ'isa. polynomial equivalence relation since the number of bits in a clique instance 
on n vertices is at least n. 

We now give a cross-composition algorithm that composes t input instances xi, . . . ,xj that 
are equivalent under TZ into a single instance of Clique parameterized by Vertex Cover. 
If the input instances are malformed or the size of the clique that is asked for exceeds the number 
of vertices in the graph, then we may output a single constant-size NO instance; hence in the re- 
mainder we may assume that all inputs are well- formed and encode structures (Gi, £), . . . , {Gt,i) 
such that = n for all i € [t] and all instances agree on the value of ^ < n. We construct 

a single instance (C, Z',£', k') of Clique parameterized by Vertex Cover, which consists 
of a graph G' with vertex cover Z' C V{G') of size k', and an integer i' . 

Let the vertices in each graph Gj be numbered arbitrarily from 1 to n. We construct the 
graph G' as follows (see also Figure [T|): 

1. Create in vertices Vi_j with i G [i] and j € [n]. Connect two vertices Vij and Vi'j' if i ^ i' 
and j 7^ j' . Let G denote the set of these vertices. It is crucial that any clique in G' can 
only contain one vertex Vi^. or v.j for each choice of i € [£] respectively j € [n]. Thus any 
clique contains at most i vertices from G. 

2. For each pair l<p<q<noi distinct vertices from [re] (i.e., vertices of graphs Gj), 
create three vertices: Wp^q, Wp^q, and wp^q and make them adjacent to G as follows: 
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(a) Wp^q is adjacent to all vertices from C, 

(b) Wp^g is adjacent to all vertices from C except for v.^j with j = q, and 

(c) Wp^q is adjacent to all vertices from C except for v.j with j = p. 

Furthermore we add all edges between vertices w.^. that correspond to distinct pairs 
from [n]. Let D denote these 3(2) vertices. Any clique can contain at most one w.^. 
vertex for each pair from [n]. 

3. For each input instance Xi with graph Gi, make a new vertex Ui and connect it to all 
vertices in C. The adjacency to D is as follows: 

(a) Make Ui adjacent to Wp^q if {p, q} is an edge in Gi. 

(b) Otherwise make Ui adjacent to Wp q and wp^q. 

Let B denote this set of t vertices. 

We define + 1 + (2)- Furthermore, we let Z' := C U D which is easily verified to be a 

vertex cover for G' of size k' := \Z'\ = £n + 3(2) • The value k' is the parameter to the problem; 
it is polynomial in n and hence in the size of the largest input instance. The cross-composition 
outputs the instance x' := {G' , Z' , k'). It is easy to see that our construction of G' can be 
performed in polynomial time. Let us now argue that x' is YES for Clique parameterized 
BY Vertex Cover if and only if at least one of the instances Xj is yes for Clique. 

(<^) First we will assume that some Xi* = {Gi*,i) is yes for Clique, i.e., that Gi* contains 
a clique on at least i vertices. Let S C [n] denote a clique of size exactly I in Gi* . We will 
construct a set S' of size i' = i + I + and show that it is a clique in G': 

1. We add the vertex Ui* to S' . 

2. Let S = {pi, . . . ,pi} C [n]. For each pj in 5 we add the vertex Vj^p^ to S'. By Step [T] all 
these vertices are pairwise adjacent, and by Step [3] they are adjacent to Ui*. 

3. For each pair 1 < p < q < n there are two cases: 

(a) If {p, q} is an edge of Gi* then the vertex Ui* is adjacent to Wp^q in G' (by Step E]) 
and Wp^g is adjacent to all vertices of G (by Step [2]). We add Wp^g to 5'. 

(b) Otherwise the vertex Ui* is adjacent to both Wp g and wp^q. Since the clique S cannot 
contain both p and q when {p, q} is a non-edge, we can add Wp^q or wp^g to S' while 
preserving the fact that is a clique. Recall that, e.g., Wp^g is adjacent to all 
vertices of G except those corresponding to q. 

In both cases we add one w.^. -vertex to 5', each corresponding to a different pair p, q; all 
these vertices are pairwise adjacent by Step[2j 

We have identified the clique S' in G' of size i' = £ + 1 + (^^, proving that x' is a YES-instance. 

(=^>) Now assume that x' is a YES-instance and let S' be a clique of size £ + 1 + G'- 
Since S' contains at most £ vertices from G (i.e., one Vi^. for each i G [£]) and at most (2) vertices 
from D it must contain at least one vertex from B, say Ui* £ B. Since B is an independent 
set, the clique S' must contain exactly £ vertices from G and exactly (2) vertices from D. 
Let 5 = {j € [n] \ Vij G 5' for some i G [£]}. The set S has size £ since S' contains at most 
one vertex v.j for each j G [n]. We will now argue that S" is a clique in Gi*. Let p,q G S. 
The clique S' must contain a w. ^.-vertex corresponding to {p, q} and it must contain vertices Vi^p 
and Vi'^g for some i, i' G [£]. Therefore it must contain Wp^g since Wp^g has no edges to vertices v.^g 
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and Wp^q has no edges to v.^p by Step [21 Thus ui* E 5' must be adjacent to Wp^q which imphes 
that Gi* contains the edge {p, q}. Thus S* is a clique in Gi*. 

Since we proved that the instance (C, Z' , , k') can be constructed in polynomial time and 
that it acts as the OR of the input instances, and because the parameter value k' is bounded 
by a polynomial in the size of the largest input instance, this concludes the cross-composition 
proof and establishes the claim. □ 

In the remainder of this work we will not be as explicit in giving the definition of the 
equivalence relationship TZ as we were in the proof of Theorem [Tj we shall simply state the 
desired forms of the inputs when it is easy to see that such a form can be achieved by a suitable 
choice of TZ. We restrict ourselves to giving cross-compositions for well-formed input instances, 
since we can simply output a constant-sized NO instance if the input is a sequence of malformed 
instances. The previous theorem yields an interesting corollary for the following structural 
parameterization of Vertex Cover. 

Vertex Cover parameterized by Clique Deletion Set 

Instance: A graph G, a set Z C V{G) such that G — Z is a clique, and a positive 

integer L 

Parameter: The size k := \Z\ of the deletion set. 
Question: Does G have a vertex cover of size at most £? 

As the vertex cover number of a clique Kt is t — 1, we can interpret this as a parameterization 
by the distance from trivially solvable instances as discussed by Niedermeier [l6]. Our results 
imply that even though the parameter will often be very large, the problem is unlikely to admit 
a polynomial kernel. 

Corollary 4. If T is a class of graphs containing all cliques, then Vertex Cover and In- 
dependent Set parameterized by the minimum number of vertex deletions to obtain a graph 
in J- do not admit polynomial kernels unless NP C coNP/poly. In particular, Vertex Cover 
and Independent Set parameterized by clique deletion set do not admit polynomial kernels 
unless NP C coNP/poly. □ 

Proof Consider an instance (G, Z, i, k) of Clique parameterized by Vertex Cover. Since 
a clique in G is an independent set in G, the Clique instance is equivalent to asking whether 
the graph G has an independent set of size at least I. Because Z is a vertex cover for G we know 
that G — Z is an independent set, and therefore G — Z is a clique. Hence the instance (G, Z, i, k) 
of Clique parameterized by Vertex Cover is equivalent to an instance (G, Z, i, k) of In- 
dependent Set parameterized by Clique Deletion Set. Since G has an independent set 
of size ^ if and only if it has a vertex cover of size |y(G)| — £ it follows that these two instances 
are also equivalent to the instance (G, Z, \ V{G) \ k) of Vertex Cover parameterized by 
Clique Deletion Set. Hence the described operations serve as polynomial parameter trans- 
formations from Clique parameterized by Vertex Cover into the mentioned problems, 
which yields the corollary by Theorem [7] and Proposition [2j □ 

4.3 Chromatic Number 

In this section we give a kernelization lower bound for Chromatic Number parameterized 
BY Vertex Cover, by cross-composing a restricted version of 3-Coloring into it. We consider 
the problem on the following class of graphs. 

Definition 11. A graph G is a triangle split graph if V{G) can be partitioned into sets X 
and Y such that G[X] is an edgeless graph and G\Y] is a disjoint union of triangles. 
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We will see that 3-Coloring instances on triangle split graphs lend themselves better to 
composition than general instances: if we take the disjoint union of a set of triangle split graphs 
and merge the induced triangles one by one, no adjacency information is lost. Before we give the 
composition, however, we must prove that 3-COLORiNG is still NP-complete with this restriction 
on the input graphs. 

3-COLORING WITH TRIANGLE SpLIT DeGOMPOSITION 

Instance: A graph G with a partition of its vertex set into X \JY such that G[X] 
is edgeless and G\Y] is a union of vertex-disjoint triangles. 
Question: Is there a proper 3-coloring of G? 

We prove the NP-completeness of this problem by replacing edges in a general instance of 
3-Coloring by triangles. 

Lemma 2. 3-Coloring with Triangle Split Decomposition is NP-complete. 

Proof. It is well-known that 3-COLORiNG on general graphs is NP-complete [30 1 GT4], and it 
is trivial to see that the problem restricted to triangle split graphs is contained in NP. We show 
how to transform an instance G of 3-coloring in polynomial time into an equivalent instance 
of 3-coloring on a graph G' with a triangle split decomposition of V{G') into sets X' and Y' . 
Number the edges in G as ei, 62, • • • , em- Construct the graph G' as follows: 

• Set V{G') := V{G) U {ai,bi,Ci \ i G [m]} and E{G') := 0. 

• Add the edges {oj, {bi, q}, {aj, q} to E{G') for i G [m\. 

• For each edge = {ui,Vi} {i G [m]) of graph G, make vertex ui adjacent in G' to Oj, and 
make Vi adjacent to hi and q. 

• Define X' := V{G) and Y' := {oj, h, a \ i £ [m]}. 

This concludes the description of G' . It is easy to see that G' is a triangle split graph with the 
partition X' and Y' since is an independent set and G'[y] is a disjoint union of triangles. 

We now show that x{G') < 3 if and only if xiG) < 3. 

(=^>) Assume that xiG') < 3 and consider a 3-coloring of G' . For every edge {ui,Vi} G E{G) 
we added a triangle on vertices {ai,bi,Ci} to the graph G' . Hence G'[NQ/{{ui,Vi})] contains a 
triangle for all pairs of vertices {ui,Vi} which are adjacent in G. If some pair Ui and Vi receive the 
same color, then this leaves only two colors to use on the triangle in their open neighborhood. 
As it takes three colors to properly color a triangle, any proper 3-coloring of G' uses different 
colors for Ui and Vi, for all {ui,Vi} G E[G). Therefore the 3-coloring of G' restricted to the 
vertex set of G is a proper 3-coloring of G. 

(<^=) Assume that G has a proper 3-coloring. We construct a 3-coloring for G' by coloring 
all vertices of V{G') fl V{G) the same as in G; now all that remains is to color the triangles we 
added to the graph. If there is a triangle {aj, hi, q} for a pair {ui,Vi} then {ui,Vi} are adjacent 
in G and hence they receive different colors in the proper coloring. Now give the color of Vj, 
give hi the color of Ui and give Cj the remaining color. If we do this for every triangle then we 
obtain a proper 3-coloring of G' which proves that x{G') < 3. 

Since the instance {G' ,X' ,Y') can be built from G in polynomial time this proves that 
3-Coloring with Triangle Split Decomposition is NP-complete. □ 

Equipped with this lemma we can prove a kernel lower bound for the following problem. 
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Chromatic Number parameterized by Vertex Cover 
Instance: A graph G, a vertex cover Z C V{G), and a positive integer 1. 
Parameter: The size k := \Z\ of the vertex cover. 
Question: Is there a proper ^-coloring of G? 

The parameterization is easily seen to be fixed-parameter tractable by noting that the chro- 
matic number of a graph exceeds its vertex cover number by at most one. Hence i < \Z\ + 1 for 
all relevant inputs, which yields an FPT algorithm by trying all ^-colorings of G[Z] and testing 
whether they can be extended to the entire graph. This test comes down to checking whether 
for each vertex v G y{G) \ Z there is a color which is not yet used on a neighbor in Z, resulting 
in a total runtime of ©(^l^ln^^^)) = 0{{k + l)'=n'^(i)). 

Theorem 8. Chromatic Number parameterized by Vertex Cover does not admit a 
polynomial kernel unless NP C coNP/poly. 

Proof. To prove the theorem we wih show that 3-Coloring with Triangle Split Decompo- 
sition cross-composes into Chromatic Number parameterized by Vertex Cover. By a 
suitable choice of polynomial equivalence relation — in the same style as in Theorem[7] — we may 
assume that we are given t input instances that encode structures {Gi, Xi,Yi), . . . , {Gt,Xt, Yt) 
of 3-COLORING WITH TRIANGLE SpLIT DECOMPOSITION with \Xi\ = n and \Yi\ = 3m for 
all i G [t] (i.e., m is the number of triangles in each instance). We will compose these instances 
into one instance {G' , Z' ,f ,k') of Chromatic Number parameterized by Vertex Cover. 
By duplicating some instances we may assume that the number of inputs t is a power of 2; this 
does not affect the truth value of the OR of the inputs, and a parameter value which is suitably 
bounded with respect to the longer sequence of inputs, is also suitably bounded with respect to 
the original list of inputs. 

For each set Yi, label the triangles in (jri[l^] as Ti, . . . ,Tm in some arbitrary way, and label 
the vertices in each triangle T,- for a set Yt as a* , 6* , c* . We build a graph G' with a vertex cover 
of size k' := 31ogt + 4 + 3m € 0{m + logi) such that G' can be i' := logt -|- 4-colored if and 
only if one of the input instances can be 3-colored. 

1. Initialize the graph G' as the disjoint union of the input graphs Gi, . . . ,Gt. 

2. For each j G [m], identify the vertices {o*- | i G [t]} into a new vertex aj. Similarly 
identify all into bj, and all c*- into cj, for each j G [m]. The vertices resulting from 
these identification operations are the triangle vertices T'. As we are effectively merging 
triangles one by one, G'[T'] is a disjoint union of triangles. 

3. Add a clique on vertices {pi \ i G [logt]} U to G'; it is called the palette. 

4. Make all vertices in T' adjacent to all vertices from the palette except x,y, and z. 

5. Make all the vertices in IJi=i -^i adjacent to w. 

6. For i G [logt] add a path on two new vertices {qQ,qi} to the graph, and make them 
adjacent to all vertices of the palette except pi and w. These vertices form the instance 
selector vertices. 

7. For each instance number i G [t] we can write a binary representation of the value i in log t 
bits. Consider each position j G [logt] of this binary representation, where position 1 is 
most significant and logt is least significant. If bit number j of the representation of i is 
a (resp. a 1) then make vertex (resp. ql) adjacent to all vertices of Xi. 
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This concludes the construction. The following claims about G' are easy to verify: 
(I) In every proper = logt + 4-coloring of G', the following holds: 

(a) each of the logt + 4 vertices of the palette clique receives a unique color, 

(b) for each i S [logt] the vertices and q\ receive different colors (since they are 
adjacent); one of them must take the color of w and the other of pi (they are 
adjacent to all other vertices of the palette), 

(c) the triangle vertices T' are colored using the colors of x,y,z (they are adjacent to 
all other vertices of the palette), 

(d) the only colors which can occur on a vertex in Xi (for all i G [t]) are the colors given 
to x,y,z and {pj \ j G [logt]} (since the vertices in Xi are adjacent to w). 

(II) For every i G [t], the graph G'[Xi U T'] is isomorphic to Gj. 

(Ill) The set Z' := {pi |iG [log t]} U {w,x,y, z} UT' U {qq, q\ |i€ [log i] } forms a vertex cover 
of G' of size k' = \Z'\ = 31ogt + 4 + 3m. Hence we establish that G' has a vertex cover 
of size 0{m + log t). 

The output of the cross-composition is the instance {G' , Z',i' := logt-|-4, k'). It is easy to verify 
that the construction can be performed in polynomial time. As the value of k' is polynomial 
in logt plus the size of the largest input, it remains to prove that the output indeed acts as 
the logical OR of the input instances: x{G') < logt + 4 if and only if there is an i G [t] such 
that xiGi) < 3. 

(^) Suppose x(G") < i' and consider some proper ^'-coloring of G' . By (jl.bp we know that 
for each i G [logt] exactly one vertex of the pair {q^, q\} receives the same color as pi. Consider 
the string of log t bits where the i-th most significant bit is a 1 if and only if vertex q\ receives 
the same color as pi. This bitstring encodes some integer i* G [t]. We focus on the instance with 
the number i*. Let Q be the set of vertices which contains for each pair {qQ,q\} {i G [logt]) 
the unique vertex which is colored the same as pi. By the definition of G' we know that all 
vertices of Xi* are adjacent to all vertices of Q; hence in any proper coloring of G' the vertices 
of Xi* cannot use any colors which are used on {pi \ i G [logt]}. By p.dp this implies that the 
coloring for G' can only use the colors of x, y, z on the vertices of Xi* . By (jl.cp the triangle 
vertices T' are also colored using only the colors of x,y, z. The graph G'[Xi* U T'] is isomorphic 
to the input graph Gi* by and since the coloring of G' only uses the colors of x,y,z on 
these vertices, this shows that the coloring of G' restricted to the induced subgraph G'[Xi* UT'] 
is in fact a 3-coloring of graph Gi*, which proves that x{Gi*) < 3 and establishes this direction 
of the equivalence. 

(<;=) Suppose x{Gi*) < 3 for some i* G [t]. We will construct a proper ^'-coloring of G'. Start 
by giving all vertices of the palette different colors. By (jll]) the graph G'[Xi* UT'] is isomorphic 
to Gi* . Re-label the colors in the 3-coloring of Gi* such that it uses the colors given to {x, y, z} 
in our partial ^'-coloring of G' . Give a vertex v in the induced subgraph G'[Xi* U T'] the same 
color as the vertex in Gi* that it is mapped to by the isomorphism. Afterwards we have a 
proper partial ^'-coloring, where all vertices of the palette, all vertices of Xi*, and all triangle 
vertices of G' are colored. It remains to color the sets Xi for i ^ i*, and the pairs {qQ,q\}. For 
each i G [logf] we color the pair {q^, q\} as follows: if the i-th most significant bit of the binary 
representation of the number is a 1 then we color q\ the same color as pi and we color q^ as w; 
if the bit is a then we do it the other way around. It is straight forward to verify that we do 
not create any monochromatic edges in this way. As the final step we have to color the sets Xi 
for i ^ i*; so consider some i G [t] with i ^ i*. The binary representation of the number i* 
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must differ from the binary representation of i in at least one position; suppose they differ at 
position j. The vertex of {qQ,qi} that matches the bit value of i* at position j was colored the 
same as pj, hence the other vertex of the pair must have been colored the same as w. Since the 
bit values differ, by the definition of adjacencies in G' we find that the vertices Xi are adjacent 
to the vertex of {qQ,qi} that is colored as w. Therefore the vertices of Xi do not have any 
neighbors colored as pj, and since Xi is an independent set we may color all vertices in it the 
same as pj. If we color all sets Xi for i ^ i* in this way we obtain a proper ^'-coloring of G' 
which proves that x{G' ) < ■ □ 

Theorem [5] shows that CHROMATIC Number parameterized by Vertex Cover is un- 
likely to admit a polynomial kernel. If we treat the number of colors as a constant q, rather than 
a variable, then the problem does admit a polynomial kernel: g- COLORING parameterized 
BY Vertex Cover has a kernel with 0{k'^) vertices for every fixed q [37]. The lower bound 
shows that q must appear in the degree of the polynomial for such kernels. 

By interpreting CHROMATIC Number on G as the problem of partitioning the vertex set 
of G into a minimum number of cliques, we obtain the following corollary to Theorem |8] — note 
that if Z C V{G) is a vertex cover of G, then G — Z is a clique. 

Corollary 5. Vertex Partition Into Cliques GT 15] parameterized by vertex- deletion 
distance to a clique does not admit a polynomial kernel unless NP C coNP/poly. 

4.4 Feedback Vertex Set and Odd Cycle Transversal 

The problems Feedback Vertex Set (fvs) and Odd Cycle Transversal (oct) ask 
whether a given graph contains a small set of vertices that intersects all cycles, respectively 
all odd cycles. The study of their natural parameterization by the solution size has led to 
important advances in the design of FPT algorithms and kernelizations. Both problems are 
fixed-parameter tractable [481 133 and admit (randomized) polynomial kernels [48 1 I42j . We 
consider the kernelization complexity of these problems from a different angle. First we study 
the non-standard parameter "deletion distance to a clique" and use a polynomial parameter 
transformation to transfer a lower bound result from the similarly parameterized vertex cover 
problem (Corollary [5]). Thus the two cycle transversal problems do not admit polynomial ker- 
nels for this parameter, unless NP C coNP/poly. Then we consider a generalization of the 
problem where each vertex is assigned an integral weight, and we seek a cycle transversal of 
bounded total weight. Using cross-composition we prove that the resulting problems do not 
admit polynomial kernels when parameterized by the cardinality of a given vertex cover. 

4.4.1 Unw^eighted problems parameterized by distance to a clique 

We give kernelization lower bounds for FVS and OCT under the following structural parameter- 
ization. 

Feedback Vertex Set parameterized by Clique Deletion Set 
Instance: A graph G, a set Z C V{G) such that G — Z is a clique, and a positive 
integer i. 

Parameter: The size k := \Z\ of the deletion set. 

Question: Is there a set S C V{G) of at most £ vertices that intersects all cycles? 

The problem Odd Cycle Transversal parameterized by Clique Deletion Set is de- 
fined analogously, by asking for a set that intersects all odd cycles. It is easy to give FPT 
algorithms for these two parameterizations. If G is a graph in which the removal of Z C V{G) 
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leaves a clique, then an FVS or OCT in G avoids at most two vertices of V{G) \ Z, as other- 
wise V{G)\Z contains a triangle not intersected by the transversal. Hence there are 0{n?') ways 
in which a valid transversal may intersect V{G) \ Z. For each valid intersection with V{G) \ Z 
we may simply try all subsets of Z to see which gives the smallest transversal that is valid for 
the entire graph, resulting in an algorithm of runtime 0(2\^^'nP^'^^). 

Despite the simplicity of the algorithms, the problems are unlikely to admit polynomial 
kernels. The following theorem strengthens the results in a preliminary version of this work 
where we proved kernel lower bounds for FVS under the smaller parameterizations "vertex- 
deletion distance to a cluster graph" and "vertex-deletion distance to a co-cluster graph" [6]. 
The theorem also subsumes results from an extended abstract by a subset of the authors, 
where a kernel lower bound for OCT was given for parameterizations by (co-)cluster deletion 
distance [3H]. 

Theorem 9. Feedback Vertex Set and Odd Cycle Transversal do not admit polyno- 
mial kernels when parameterized by a clique deletion set, unless NP C coNP/poly. 

Proof. By Corollary U] and Proposition [2] it is sufficient to prove that there are a polynomial- 
parameter transformations from Vertex Cover parameterized by Clique Deletion Set 

into FVS PARAMETERIZED BY CLIQUE DELETION SeT and OCT PARAMETERIZED BY CLIQUE 

Deletion Set. We give a single construction that works for both cases. 

Let {G,Z,£,k) be an instance of Vertex Cover parameterized by Clique Deletion 
Set. Let us first argue that the edges of G can be covered by a small number of cliques; this 
will be essential for our transformation. Take the family of vertex subsets Q defined as follows: 
g := E{G[Z]) U {V{G) \Z}U {Ng[v] \ {Z \ {v}) \ v G V{G) \ Z}. The endpoints of an edge 
trivially form a two- vertex clique, the problem definition ensures that G — Z is a clique, and each 
vertex together with its neighborhood in the clique G — Z also forms a clique: hence each subset 
induces a clique in G. Since each edge of G is either contained in G[Z], contained in G — Z, or 
connects a vertex in Z to a vertex outside Z, the family of cliques covers all edges. It is easy 
to see that \Q\ < (^) + 1 -|- and hence we can cover the edges in the input instance by a 
number of cliques which is polynomial in the parameter. The clique cover Q can be computed 
in polynomial time. We use it to construct an output instance as follows. 

Initialize G' as a copy of G. For each clique C G ^, add a vertex vc to G' with Nqi{vc) '■= C. 
Let Z' be the union of Z and all the vertices vq added in this way. We find that \Z'\ = 
\Z\ + \g\ and therefore \Z'\ is bounded polynomially in the input parameter \Z\; let k' := \Z'\. 
A.S G' — Z' = G — Z , removal of the set Z' from G' results in a clique. We claim that when 
interpreting the tuple {G' , Z' ,l,k') as an instance of FVS or OCT parameterized by a clique 
deletion set, the answer to the instance is the same as for the input instance (G, Z,i, k). 

Suppose that (G, Z,i, k) is a YES-instance of Vertex Cover parameterized by Clique 
Deletion Set, and let S C V{G) be a vertex cover of G of size at most i. By the definition of 
a vertex cover, G — is an independent set. Since every subset G G ^ is a clique in G', it follows 
that |G \ S| < 1 for all G & Q. Observe that we can obtain G' — S from G — S" by introducing 
the vertices vc for all C ^ Q. Starting from the graph G — S without edges, and using the fact 
that [G \ 5| < 1 for all G G G' it is easy to see that adding the vertices vc to obtain G' — S 
comes down to repeatedly adding vertices of degree at most one to an acyclic graph. As such 
a process does not create cycles, G' — 5 is a forest. Therefore S is both an FVS and an OCT 
for G', of size at most i, which proves that (G', Z' ,1, k') a YES-instance for the two considered 
problems. 

For the reverse direction, suppose that there is a set S' C V{G') of size at most I whose 
removal from G' leaves a graph without odd cycles; if {G' , Z' ,£,k') is a YES-instance of FVS or 
OCT then such a set exists. We first show that without loss of generality, we may assume vc 5" 
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for all C & Q. To this end, assume that there is some C* G ^ such that vc* € S'. If C*\S' = 0, 
then S' \ {vq} is also an OCT for G' as all neighbors of vc* are contained in the deletion set. 
If, on the other hand, there exists some u £ C* \ S' , then we claim S* := {S' \ {u}) U {vc*} is 
also an OCT in G' . Indeed, by construction of G' we have Ng'[vc*] ^ ^G'bA and therefore we 
can substitute u for vc* in any odd cycle in G' — S* to obtain an odd cycle in G' — S'. As the 
latter graph does not contain odd cycles, it follows S* is an OCT in G' . Since the replacement 
process can be performed independently for each C G ^ we conclude that if (G' , Z',i, k') is a 
YES-instance, then G' has an OCT S' of size at most i that does not contain any vertex vc 
for c £ G. Now observe that such a set S' forms a vertex cover of graph G: G — S contains 
an edge {x,y}, then the edge clique cover Q contains a clique Cx^y 5 {x,y}. As vc^ y ^ S' by 
construction, this implies G' — S' contains a triangle on x,y and fc^,,^; a contradiction. This 
establishes the correctness of the transformation. Since it can be carried out in polynomial 
time, and the resulting parameter \Z'\ is bounded polynomially in the input parameter \Z\, the 
theorem follows. □ 



4.4.2 Weighted problems parameterized by vertex cover 

In this section we consider the kernelization complexity of weighted versions of FVS and OCT, 
parameterized by the cardinality of a given vertex cover. Our results extend the line of research 
initiated by a subset of the authors, who showed that Weighted Vertex Cover does not 
admit a polynomial kernel under such a parameterization, unless NP C coNP/poly |36j . The 
cross-composition we present unifies a construction from a preliminary version of this work |6] 
for FVS, with a construction for OCT which appeared in an extended abstract by a subset of the 
authors |38) . Let us formally define the problems under consideration. 

Weighted Feedback Vertex Set parameterized by Vertex Cover 
Instance: A graph G, a vertex cover Z C V{G), a weight function w: N ^ N, and 
a positive integer £. 

Parameter: The size k := \Z\ of the vertex cover. 

Question: Is there a set S C V{G) that intersects all cycles in G and satis- 
fies T^veS^i^) ^ ^• 

The related problem Weighted Odd Cycle Transversal parameterized by Vertex 
Cover is defined in the natural way. The parameterized problems are easily seen to be fixed- 
parameter tractable by noting that the treewidth of the input graphs is bounded by k, which 
allows for a solution by standard dynamic programming techniques [9] . We start by presenting 
a construction that will be the key to merging multiple input instances. 

Definition 12. For a graph G define its infiation 0(6*) by the following steps: 

• Obtain graph G' from G by completing every edge into a triangle with a new vertex: for 
each edge e = {u,v} G E{G), add a new vertex Ve with Ncive) := {u,v} to G' . 

• Obtain graph (p{G) from G' by replacing every edge {n, v} G E{G') by a path on two new 
vertices: remove edge {n, v} while adding vertices pi and p2 with the edges {n,pi}, {pi,P2} 
and {p2-,v}. 

Note that by this definition, the vertex set of </>(G) is a superset of V{G); hence we may 
take a subset S C V{G) and consider the effect of removing S from 4>{G). As an example of 
the inflation operation, note that the infiation </>(i^2) of a single edge yields a simple cycle on 
nine vertices. 
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Observation 1. For any graph G and its inflation (l){G), the following holds: 

• The graph (/'(G) — V{G) is the disjoint union of 3|£'(G)| copies of P2- 

• The set V{G) is independent in (j){G). 

There is a correspondence between vertex covers of G, and sets that form an FYS or OCT in 
the inflation (f){G). This correspondence will enable us to cross-compose instances of Vertex 
Cover into the cycle covering problems. 

Lemma 3. For any graph G and integer i, the following conditions are equivalent. 

1. The graph G has a vertex cover of size at most L 

2. The graph (piG) has a feedback vertex set of size at most I. 

3. The graph (p{G) has an odd cycle transversal of size at most I. 

Proof. ([T]=4^2|) Suppose that S C V{G) is a vertex cover of G of size at most £, and consider an 
arbitrary simple cycle C in 4>{G): we will show that S contains a vertex on G. Let G' be the 
intermediate graph obtained from G that is used in the construction of (/>(G) in Definition [T2j As 
all edges were replaced by P2's when going from G' to (j){G)^ it follows that C can be transformed 
into a cycle C' in G' by replacing such P2's by direct edges. Hence G corresponds to a simple 
cycle G' in G' such that V{G') C V{G). 

• If G' contains a vertex Ve that was introduced to complete an edge e = {li, v} into a 
triangle in the first step of Definition [T2l then Ve only has u and v as neighbors and 
therefore G' contains u and v. This pair is connected by the edge e in G. As S is a vertex 
cover of G it contains one of the vertices u and v, and therefore intersects the cycle G' . 
Since V{C') C V(C) the set S intersects G as well. 

• If G' does not contain any vertices Ve used to complete edges into triangles, then V{G') C 
V{G) and hence the cycle G' also exists in G. Now the vertex cover 5 of G contains at 
least one vertex of G', which implies that it intersects G. 

Hence all cycles in (j){G) are intersected by 5, which shows that S is an FYS of size at most £ 
in (t){G). 

(I5]=4^3D An FYS of size at most H. in (/>(G) is also an OCT: the former covers all cycles, and the 
latter only has to cover odd cycles. 

dSH^H) Suppose S C V{(t){G)) is an OCT of 0(G), of size at most L We first show that we 
may assume S C V[G). So assume S contains a vertex v V{G), i.e., a vertex which was 
introduced by the infiation operation to subdivide an edge or complete an edge into a triangle. 
From Definition [12] it follows that v has degree two in (j){G), and that by following a path of 
consecutive degree-two vertices in </>(G) we arrive at a vertex u in V{G) V{(j){G)). As all 
cycles through v contain the vertex n, any cycle intersected by v is also intersected by n, and 
therefore (5\{z;})U{ii} is also an OCT of (/)(G). By repeating this step we obtain a set S C V{G) 
of size at most i which intersects the odd cycles of 4'{G). We claim that this 5" is a vertex cover 
in G. 

Assume for a contradiction that there is an edge e = {n,?;} in G for which no endpoint is 
contained in S. The graph G' used in the construction of (t){G) has a vertex Ve adjacent to u 
and V, and hence contains a triangle G' := (u, Ve,u). When creating (p{G) from G', each edge on 
this triangle is subdivided by two new vertices resulting in an odd cycle G in (j){G) on 3 -|- 6 = 9 
vertices. Hence (j){G) contains an odd cycle G such that V{G) n V{G) = {u,v}. As S does not 
contain u nor v by assumption, (j){G) — S contains an odd cycle which implies that S is not an 
OCT for (t){G)] a contradiction. □ 
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Figure 2: The i^4-in-a-box graph 3^4^ with labeled vertices. 

Before presenting the cross-composition we also introduce the following gadget, which will 
be used clS cL bit selector. 

Definition 13. The K^-in-a-box graph Bk^ (see Figure[2|) is the graph obtained from a clique on 
four vertices {a, b, c, d} by adding a new degree-2 vertex v for each pair {a, b}, {b, c}, {c, d}, {d, a} 
such that V is adjacent to both vertices of the pair. The vertices {a, c} are the 0-labeled terminals 
of the graph, and the vertices {b, d} are the 1-labeled terminals of the graph. 

Observation 2. Any vertex set that covers all triangles of Bk^ has size at least two. Every 
size-2 vertex set covering all triangles either consists of the 0-labeled terminals or the 1-labeled 
terminals. 

We present a single construction that can serve as a cross-composition of the classical Ver- 
tex Cover problem into both cycle covering problems considered in this section. Recall that 
an instance of the NP-complete [3Ul GTl] Vertex Cover problem is a tuple {G,£) and asks 
whether G has a vertex cover of size at most i. 

Theorem 10. Weighted Feedback Vertex Set and Weighted Odd Cycle Transver- 
sal do not admit polynomial kernels when parameterized by vertex cover, unless NP C coNP/poly. 

Proof. We prove the theorem by showing that Vertex Cover cross-composes into both Weighted 
Feedback Vertex Set and Weighted Odd Cycle Transversal under the chosen param- 
eterization. The same cross-composition works for both problems. By a suitable choice of poly- 
nomial equivalence relation we may assume the input consists of well- formed instances (Gi, ^), . . . , 
{Gt,£) of Vertex Cover that all have the same number of vertices, say n, and edges, say m, 
and that have the same target value i. By the argument given in the proof of Theorem [8] we 
may assume that t is a power of 2. As a graph on n vertices trivially has a vertex cover of size n, 
we may output a constant-size YES-instance ii i = n and assume that i < n in the remainder 
of the proof. 

We will merge the inflated versions of the input graphs to obtain the desired cross-composition. 
By Observation [1] it holds for each input Gi that (j){Gi) — V{Gi) is a disjoint union of 3m copies 
of P2. Label the P2's in (t){Gi) — V{Gi) arbitrarily from 1 to 3m, and assign each P2 a left and 
right endpoint. Construct a graph G' with weight function w' as follows. 

1. Initialize G' as the disjoint union of the graphs (t){Gi) for i € [t]. 

2. For each j G [3m], consider the j-th copy of P2 in the inflated input graph (j){Gi) — V{Gi). 
Modify the graph G' by identifying all the left endpoints of the j-th P2 into a single vertex. 
Similarly identify all the right endpoints of the j'-th P2 into a single vertex. Let A' C V{G') 
contain the resulting merged vertices. Observe that as we merged P2^s one by one, the 
graph is a disjoint union of P2^s and therefore G'[V{Gi)L)A'] is isomorphic to <l){Gi) 
for all i e [t]. 



23 



3. We can represent an instance number in the range [t] using exactly logt bits since we 
assumed i is a power of 2. For each bit position j G [logt] we create a copy of the 
graph described in Definition [T31 We label its 0-terminal vertices {^j,o') ^j,0"} and 
the 1-terminal vertices {bj^i', For each instance number i whose j-th most significant 
bit in the binary expansion is a 0, we make all vertices of V{Gi) in G' adjacent to the 0- 
terminals {bj^', bj^Q"}, and for instance numbers whose bit value is 1 we make it adjacent 
to {bj^r,bj^i"}. 

4. Assign each vertex corresponding to a copy of Bk^ a weight of t • n, and give all other 
vertices weight 1. 

This concludes the description of the graph G' and weight function w'. Since a valid instance of 
Weighted Feedback Vertex Set parameterized by vertex cover also contains a vertex cover 
set Z', we must supply such a vertex cover as part of the output of the procedure. Observation [1] 
shows that for each input Gi, the set V{Gi) is independent in (f>{Gi). As G' contains the disjoint 
union of the inflated input graphs, and since merging the P2's does not introduce edges in 
the set IJj=i ^{Gi), it follows that a vertex cover Z' of G' can be formed by taking the union 
of A' and all vertices of each of the logt copies of Bk^- The size of this vertex cover Z' 
is l^'l + 8 logt = 3m + 8 logt. The parameter value k' := \Z'\ is bounded by a polynomial 
in logt plus the size of the largest instance. We set i' := 2(log t)t • n + (t — l)n + i. It is easy 
to see that this construction can be carried out in polynomial time. The constructed tuple can 
serve as an instance of Weighted fvs parameterized by Vertex Cover or Weighted 
OCT parameterized by Vertex Cover; we will prove that this makes no difference for 
the truth value. We therefore conclude the proof by showing that the following conditions are 
equivalent: 

1. The graph G' has a feedback vertex set of weight at most i' . 

2. The graph G' has an odd cycle transversal of weight at most £'. 

3. There is an input i* S [t] such that Gi* has a vertex cover of size at most i. 

Clearly the equivalence of these conditions shows that the construction serves as a cross- 
composition of Vertex Cover into both problems, which establishes the kernelization lower 
bounds by Corollary [2j 

([T]=4^2l) Any feedback vertex set is also an odd cycle transversal of the same size. 

dlH^Sl) Assume that G' has an OCT S' of total weight at most 2(log t)t-n + {t- l)n + i. The 
graph G' contains logt vertex-disjoint copies of the graph Bx^- By Observation [2] at least two 
vertices are needed to cover all the triangles in Bx^, hence the transversal S' contains at least 
two vertices from each copy of Bx^ ■ If there is some copy of Bk4^ from which S' contains more 
than two vertices, then as each vertex in a copy of Bx^ has weight t • n the set S' has weight 
at least (2(logt) -|- l)t • n, which is strictly greater than i' as i < n. Therefore any OCT S' of 
weight at most i' contains exactly two vertices from each copy of Bx4^ ■ From Observation [2] 
it then follows that for each copy of Bx^ the set S' contains either the 0-terminal vertices or 
the 1-terminal vertices, as no other pair of vertices can cover all triangles. We now construct 
the binary representation of an instance number using the contents of S' . Let the j-th bit of the 
number be a 1 if set S' contains and a in the case that it contains {6j^o'i ^j,0"}; 

let i* denote the instance number in the range [t] that is represented by this bitstring. 

Observe that by the choice of i*, for all vertices in V{Gi*) all of their neighbors in the Bx^ 
graphs are contained in S' . On the other hand, if we consider some instance number i' ^ i* then 
there is at least one bit position where the representations of the numbers i' and i* differ. Let j 
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be such a bit position and assume for the moment that the j-th bit of the number i* is a 1 , which 
impHes that the j-th bit of i' is a (the other case is symmetric). Then S' contains the terminal 
vertices {bj^y ,bj^ii/} but does not contain {bj^/ ,bjfi/i}. But then S' must contain ah vertices 
from the set V{Gi'), for if S' would avoid some vertex v G V{Gi') then the graph G' — S' 
would contain a triangle on vertices {v,bjQ',bjfi"} which contradicts the assumption that S' 
is an OCT. This shows that for all instance numbers i' ^ i* the set S' contains all vertices 
of V{Gi'). These vertices together with the two terminal vertices in each copy of Bk^ account 
for 2(logt)t • n + (t — l)n of the weight of S' , and therefore the remaining vertices in S' have 
weight at most i; in particular S' contains at most i vertices from the set V{Gi*) U A' since 
each such vertex has weight 1. Choose S^, as S' n {V{Gi*) U A'); the preceding argument 
shows I < £. Since S' is an OCT for G' it breaks all odd cycles in all induced subgraphs, 
hence S'^* is an OCT of G'[V{Gi*) U A']. During the construction of G' we already observed that 
the graph G'[V{Gi*)uA'] is isomorphic to (j){Gi*), which together with the previous observation 
shows that (j){Gi*) has an OCT of size at most l^^*! < By Lemma E] this implies that Gi* 
has a vertex cover of size at most thereby proving that [Gi* ,i) is a YES-instance of Vertex 
Cover. 

(I3]=4{T]) For the reverse direction, assume that one of the input instances, say {Gi*,£), is a 
YES-instance of Vertex Cover, i.e., that Gi* has a vertex cover of size i. By Lemma [3] this 
implies that (j){Gi*) has an FVS of size at most i. We construct an FVS S' in G' of weight at 
most i' . 

1. For each bit position j G [logt] add {bj^/ ,bjfi"} to S' if the j-th bit of i* is a 0, and 
otherwise add {6j i/, 6j i"}. This contributes a total weight of 2(logt)t • n. 

2. Add the set IJi^^i* ^(^i) to S' for a total weight addition of {t — l)n. 

3. By the construction of G', the graph G'[V{Gi*) U A'] is isomorphic to (j){Gi*), and by as- 
sumption this graph has a feedback vertex set of size at most £'. Add an FVS of G'[V{Gi*)U 
A'] of size at most ^ to 5'; as the vertices in this subgraph have a weight of 1 under w' , 
this increases the weight of S' by at most i. 

Summing up the weight increases we find that the resulting set S' has weight at most I'. To see 
that S' indeed intersects all cycles in G', observe that by taking two matching terminal vertices 
for each copy of 3^4^ we have broken all cycles within the Bk^ graphs. For all sets V{Gi') 
with i' 7^ i* we have taken all the V{Gi') vertices into S' so G' — S' cannot contain cycles 
through such sets V{Gi'). By taking the appropriate terminal vertices in S' we have broken all 
connections between vertices in V{Gi*) and vertices in copies of Bk4^- Finally there can be no 
cycles in G'[V{Gi*) U A'] — S' since we have added an FVS for this graph to S'. Hence S' indeed 
covers all cycles, and has the desired weight. □ 

Slight variations of the proof of Theorem [10] can be used to show kernel lower bounds 
for various other weighted vertex-deletion problems parameterized by vertex cover, such as 
Chordal Deletion and Planar Deletion. Observe that the weight function used in the 
construction is very simple: it uses only two possible weight values, namely 1 and t ■ n, both of 
which are polynomially bounded in the total input size to the cross-composition algorithm. The 
theorem shows that when it comes to weighted problems, even graphs with a simple structure 
(i.e., graphs with a small vertex cover) can encode the logical OR or many input instances. 

5 Conclusions 

We have introduced the cross-composition framework and used it to derive kernelization lower 
bounds for structural parameterizations of several graph problems. Through its many appli- 
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cations [HElIIlIISlIISlIIIlIiaEIllMlES], the framework has ah-eady proven itself to be a 
fruitful tool in the study of kernelization complexity. The key to many cross-compositions is 
choosing a convenient NP-hard source problem. As the examples in Section H] show, a series of 
input graphs (Gi, . . . , Gt) may be amenable to composition if the vertex set of each graph can 
be partitioned into XiUYi such that all graphs G'j[Xj] are isomorphic, and all graphs are 
isomorphic. In such settings we may merge the graphs by identifying all subgraphs Gi[Xi] into 
a single canonical copy. Our construction for Chromatic Number employs graphs which de- 
compose into an independent set and disjoint triangles, whereas the inflation 4>{G) of a graph G, 
which underlies the construction of Theorem [TOl can be decomposed into an independent set 
and a disjoint union of P2^s. The importance of starting from a convenient source problem when 
proving kernel lower bounds can also be observed in the recent work of Dell and Marx [20j and 
Cygan et al. [TB]. 

Two recent papers [39\ HQ] extend the cross-composition framework by the use of co- 
nondeterministic (cross-)compositions. The full version of [SS] contains the natural variant 
of defining a co-nondeterministic OR-cross-composition and a formal proof for the fact that it 
excludes polynomial kernels and compressions when the dependence on t is at most f{t) = 
the same follows immediately regarding polynomial lower bounds when f{t) = t^/'^+"W^ Note 
that a similar result for AND-cross-compositions is highly unlikely; it is an easy exercise to con- 
struct a co-nondeterministic AND-cross-composition for any language. Furthermore, Kratsch et 
al. [iQ] point out that OR-cross-compositions do not need NP-hard source problems, but that 
NP-hardness under co-nondeterministic many-one reductions suffices. To see this, note that the 
consequence of a cross-composition for L C E* into Q C S* x N together with a polynomial ker- 
nel or compression for Q implies L G coNP/poly. If L is (only) NP-hard under coNP-many-one 
reductions then this still implies NP C coNP/poly. (This of course holds for any lower bound 
technique that produces the consequence that L £ coNP/poly.) 

We hope to see many applications of cross-composition in the future. 

Acknowledgments. We would like to thank Holger Dell for insightful discussions. 
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